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for  right  censored  observations  is  used  to  obtain  an  estimator 
of  the  ratio  of  scale  parameters  of  two  distributions.  It  is 
shown  that  this  estimator  is  consistent.  For  small  samples, 
simulations  are  performed  which  show  the  superiority  of  the 
estimator  over  the  maximum  likelihood  estimator  for  exponential 
distributions. 
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1 .  INTRODUCTION 


Point  estimation  for  the  squared  ratio  of  the  scale  parameters  associated 
with  the  ciass  of  rankiike  statistics  lias  been  discussed  in  many  elementary 
statistics  books  (lor  example,  Hollander  and  Wolfe  1973).  In  this  article,  we 
consider  a  special  situation  in  which  the  two  underlying  distributions  with  positive 
support  only  differ  by  their  scale  parameters.  The  quantity  in  which  we  are 
interested  is  the  ratio  of  these  two  parameters.  A  simple  and  obvious  esti¬ 
mator  ol  this  quantity  based  on  two  independent  samples  would  be  the  median 
of  all  possible  ratios  of  the  observations  in  one  sample  to  those  in  the  other 
sample.  This  estimator  Is  essentially  a  Hodges  and  l.ciunann  (1960)  estimator  (Randles 
and  Wolfe  1979)  througii  a  logarithmic  transformation. 

Unfortunately,  in  many  situations  the  observations  may  be  censored  or 
truncated.  For  example,  tVie  parameter  of  interest  may  be  the  length  of  survival. 

It  is  common  that  at  the  end  of  the  trial  there  may  be  incomplete  survival 
information  on  certain  individuals.  An  (nonparametric)  estimator  of  the  ratio 
of  the  scale  parameters  is  proposed  based  on  arbitrarily  right-censored  observa- 
i ions  in  this  article.  The  proposed  estimator,  after  a  logarithmic  transformation, 
is  a  minimum  Cramer-von  Mises  distance  estimator  (Fine  1968,  Parr  1980)  and 
reduces  to  the  Hodges  and  Lehmann  type  of  estimator  mentioned  in  the  pi  ivious 
paragraph  for  the  uncensored  case.  The  parametric  estimation  procedure  under 
arbitrary  censorship  Is  generally  rather  complicated.  For  example,  only 
approximate  solutions  to  the  Likelihood  equations  may  be  obtained  for  two 
censored  samples  from  Weibuli  distributions  with  the  same  unknown  shape  parameter 
and  different  scale  parameters. 

Specifically,  In  this  paper  suppose  that  two  positive  random  variables 
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S°  and  T0  differ  in  distribution  only  by  their  scale  parameters.  That  is,  there 

exists  a  positive  constant  9  such  that  0S^  and  have  the  same  distribution. 

Let  S?,  ....  be  independently  distributed  as  and  let  T?,...,T^  be 
i  id  in 

independent  and  distributed  as  and  also  independent  of  . S^.  Further- 

more,  it  is  assumed  that  and  T?  may  he  censored  from  the  right  by  random 

I  I 

variables  (or  constants)  and  V.,  respectively,  1*1,.. ,,m,  j*l,...,n. 


We  wish  to  estimate  0  based  on  the  observations  (s^.6^),  i*l,...,m,  and 
(tj.Cj),  j=J .... ,n,  where 


and 


u  * 

Sj  *  min  {si,  u.^}  and  6^, 


(}  » 

min  {t  ,  v,}  and  c.  * 

J  .1  .1 


1  ifSi*s- 
0  otherwise, 

1  if  t.=t° 

J  J 

0  otherwise. 


i*  1 , . . .  ,m , 


j*l , . . . , n. 


and 


In  the  development  of  the  estimator  of  0,  we  consider  the  transformation 

0  0  ’ 
x.  »  An  s  ,  x.  *  An  s. ,  u,  *  in  u.,  1*1,..., m 

i  i  j.  i  i  i 


0  0  ' 
yj  ■  t],  y}  *  «.n  tj,  Vj  -  «,n  Vj,  j-1 . n. 


Then 


the  observations  are  (x^  <S{),  i*l,...,m  and  (yj,  c^),  J*l,...,n,  where 


n  if  x  -  x°  ji  - - 0 

6,  *  *  and  cj  *  | 


lf 


0  otherwise 


otherwise  . 


Thus,  the  problem  becomes  one  of  estimating  the  shift  parameter  A  •  An  0,  based 
on  (x^,  6^),  1*1 , . . . ,m,  and  (y^,  tj),  J*J,...,n,  where  +  A  has  the  same  dis¬ 
tribution  as  Y“  lor  each  1  and  j.  We  denote  Ihe  distribution  function  of  X^ 
by  K  and  that  of  Y**  by  0  so  that  F(y  -  A)  •  G(y)  for  all  y. 
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The  derivation  of  the  proposed  estimator  of  8  (or  A)  is  given  in  Section  2. 
The  consistency  of  the  estimator  is  discussed  in  Section  3.  For  saall  samples, 
the  comparison  of  our  estimator  with  the  maximum  likelihood  estimator  of  the 
ratio  of  the  scale  parameters  of  two  exponential  distributions  is  made  in  Section 
4  based  on  the  results  of  computer  simulations.  The  simulations  indicate  that 
our  estimator  is  superior  to  the  maximum  likelihood  estimator  even  for  this 
"nice"  parametric  case. 

2.  THE  MINIMUM  DISTANCE  ESTIMATOR  OF  A  -  In  8 


Without  loss  of  generality  we  assume  tliat  x^  s  x^  S...S  x^  and  y^£  y ^  *•••* 
Define  the  Kaplan-Meier  estimators  of  FC  ■  1-F  and  GC  ■  1-G,  respectively, 
by  (F.fron  1967) 


and 


FC(x)  - 


k-l 

11 

(  m-  i  .  i 
'  m-i+i  ’  ' 

x  c 

(X] 

i«l 

1. 

x  < 

X1 

o. 

X  > 

X 

m 


f  k-1 

n  ( 

j-i 


Cc„(,)  .-I 


'‘I'wA1’'” . " 


1, 

0, 


y  * 

y  >  y.. 


Also,  define  the  lumps  ol  Fl  at  the  x.’s  by  a.  (if  the  x  is  censored,  the 

mil.  i 

jump  is  zero),  that  Is, 


p> i>  -  "l . -1 

£<»  )  ,  i"n. 

In  m 


(2.1) 


■ 
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Similarly,  let  b  denote  t lie*  Jump  of  G  at  y  lor  each  j-l,...,n. 

J  *  J 

We  defined  to  he  a  value  of  A  which  minimizes  the  Cramer-von  Mises 
mu 

A  A  a  AAA  A 

distance  /  IF  (t-A)  -  C  (t).l  dt  where  F  -  1  -  Kc  and  G  -  1  -  G  . 

■"  m  n  m  m  n  n 

hotting  A  he  some  real  number  such  that  A  >  maxfx.  +  A,...,x  +  A,  y  } 

i  min 

and  letting  1  be  the  indicator  function  of  the  set  B,  we  can  write 

D 

/r.'v«-»)  -  voAt  ■  /*.[ j" i(_„t t_4]<v 

-  j,  bj  i(-*.d(,j)i2<it‘  (2-2> 

Squaring  and  collecting  terms,  (2.2)  equals 

n  n 


m 

Af  l 

o 

l 

m  n 

aiak  *  2  l  l 

*ibj 

i-l 

k-1 

*  i— 1 

m 

m 

-  I 

l 

a^*k  max{xA+  A,  xfc 

+  A) 

i-l 

k-1 

n 

n 

m 

-  1 
J-1 

l 

i-l 

bjb£  max{y^ ,  y&}  + 

2  l 

i-l 

l  ajbj  max(x1  +  A,  y^}.  (2.3) 

The  first  term  In  (2.3)  is  ssro,  and  the  third  term  does  not  involve  A.  Hence, 

the  problem  is  to  minimize  over  A  the  expression 

m  n  mm 

2  l  l  s.b.  msx(x  ♦  A,  y.)  -  l  l  s.a.  msx{x  +  A,  x.  ♦  A) 

i-l  j-1  1  J  1  J  i-1  k-1  1  K  l 

-  I  l  a.Mx  ♦  *  l  I  •iblly1  "  *i  “  Al* 

i-l  j-1  1  J  1  1  i-l  J-1  1  J  3  x 


Therefore,  we  wieh  to  find 

m  n 


l  )’  V.ly,  *  *i  “  A|. 
A  i-l  J-J  1  3  J  1 


(2.4) 
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m  n 


Since  7  7  a.b.  =  1,  It  is  easy  to  see  that  a  value  A  of  A  which 

i-1  j-1  1  j  mn 

solves  (2.4)  is  a  median  of  the  (discrete)  probability  distribution  function 

>1  whose  probability  density  function  is  defined  by 
mn 


h  (  v  ) 

mn 


aibJ  *  V 


yj  "  V 


0, 


otherwise. 


| . . .  ,m 

j“l . n 


(2.5) 


Therefore,  the  corresponding  estimator  of  0  is  0  *  exp(A  ). 

inn  mn 

Note  that  if  the  two  samples  X,,...»X  and  Y,,...,Y  are  not  censored, 

1  m  i  n 

the  median  of  the  distribution  (2.5)  which  solves  (2.4)  is  exactly  the  median  of 

all  the  differences  y.  -  x.  since  a,  ■  —  and  b,  *  —  for  all  i  and  1  (Fine  1968). 

j  l  i  m  J  n 

It  is  also  interesting  to  note  tliat  1-H  can  be  represented  as  the 

mn 

c  c 

convolution  of  the  two  Kaplan-Meier  estimators  of  F  and  G  ,  that  is, 

H  (v)  »  /*„ F*(t-v)d  C  (t)  •-/*„, P®(t-v)d  G^(t). 
mn  #  — ,n  m  n  m  n 


This  representation  of  H  will  be  used  in  the  next  section  to  show  that  A 

'  mn  mn 

is  a  consistant  estimator  of  A. 


3.  CONSISTENCY  OF  THE  ESTIMATOR 


For  mathematical  convenience  we  assume  that  the  censoring  variables 

lUjJ  and  ( V j }  are  independent  random  variables  with  common  distribution  functions 

.1  and  K,  respectively.  Also,  the  U(  and  are  assumed  to  be  independent  of 
0  0 

the:  and  Yj  and  independent  of  each  other. 

For  any  real  number  v.  It  follows  from  Theorem  8.2  of  Efron  (1967)  that 

H  (v)  ■  “  <£(*>  prx?  +  v  a  Y°]  -  H(v)  (3.1) 

mn  "  m  n  1  j 
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as  m,  n  •+  *>  so  that  —— — ►  A  and  -*■  1-A,  0<A<1,  where  H  is  the  distribution 

nttn  turn 

function  of  -  X^,  which  is  symmetric  about  A. 

We  now  prove  the  following  theorem. 

THEOREM  3.1  Let  m,n  ♦  <*>  so  that  ■*  A  and  —7—  ♦1-A.  0<A<1.  Then 

nrt-n  nrrn 

"  P 
A  -Vi. 
mn 

1 

PROOF.  Let  e  >  0  and  c  *  c/2.  Since  A  is  the  unique  median  of  H,  we 

•  ]  ~  A  •  1 

have  H(A  Since  A  is  a  median  of  H  ,  H  (A  +  c  )  2  — . 

2  mn  mn  mn  mn  2 


»  p  * 

By  (3.1  ),  II  (A-c  )  — >  H(A-e  )  as  m,  n  ♦  *».  But  the  event  that 

A  1  1  it 

A  +  c  <  A  -  c  imp  Lies  the  event  that  -S  II  (A  -  c  ) .  Thus, 
mn  2  mn  * 

1  11  t 

P(A  +  c  <A-c)5P(-^sH  (A  -«•))-♦■  0  as  m,  n  -*•  “>  which  implies  that 
mn  4  mn 

A 

P(A  i  A  -  t)  +  1. 
mn 

L  P 

On  the  other  hand,  H(A  +  c)  >  -r  and  H  (A  +  e)  — >  H(A  +  e)  as  1,  n  -►  00 

2  mn 

A  ^  A 

under  the  conditions  of  the  theorem.  Also,  H  (A  )  <,  — ,  and  since  A  >  A  +  e 

mn  mn  2  mn 

1  A  1 
implies  that  —  >  H  (A  +  c),  we  have  P(A  >  A  +  c)  s  P(—  2  H  (A  +  e))  ♦  0 
2  mn  mn  2  mn 


as  m,  n  -*■  ®. 
Thus , 


P(A  -  t  S  A  s  A  +  e)  ♦  1  as  m,  n  ♦  «*. 
mn 


/// 


Therefore,  the  estimator  6  of  the  ratio  of  scale  parameters  for  the 

mn 

original  problem  is  a  consistent  estimator  of  8. 


A.  SMALL  SAMPLE  COMPARISONS 


In  addition  to  the  nonparametric  nature,  an  obvious  advantage  of  the 

A  At 

estimator  8  (or  A  )  is  that  it  is  easily  computed  in  closed  form,  whereas 
mn  mn 

lor  arbitrarily  right  consorcd  data  in  many  parametric  models  the  scale  parameter 
cannot  be  estimated  in  closed  form  by  maximum  likelihood  or  other  procedures. 


7 


One  parametric  case  which  yields  a  closed  form  maximum  likelihood  estimate  of 
the  ratio  of  the  two  scale  parameters  for  censored  observations  is  the  exponential 


distributions.  In  this  section  we  will  compare  our  estimator  6  with  the  mle 

r  mn 

for  smaLl  size  censored  samples  from  two  exponential  distributions  with 
different  scale  parameters  through  computer  simulations. 

In  the  notation  of  Section  1,  let  S^,  1=1, ...,m  and  Tj„  j=l,...,n  be 
samples  from  the  exponential  densities  f^(s)  *  X  ^  exp(-s/A)  s)  and 


g^(t)  =  ( XO )  1  exp(-t/A0)  I.  .  (t),  respectively.  The  maximum  likelihood 

LO,00; 

n  m  m  n 

estimator  of  0  is  found  to  be  0  =  7  t.  7  6  /  (  £  s.  7  e,),  which  is 

mn  1=1  j  1=1  1  1=1  1  j=l  J 

the  ratio  of  the  two  estimates  of  average  lifetime  from  the  respective  censored 


sampl es . 

/* 

in  order  to  compare  the  mle  U  with  our  estimator  0  of  0,  we  have 

mn  mn 

performed  Monte  Carlo  simulations  to  indicate  the  bias  and  mean  squared  error 
of  each  estimator.  Taking  A  =  ]  in  f®  and  g^  for  all  the  computations,  the 

»  i 

censoring  variables  U^,  1=1,..., m  and  V j ,  j=l,...,n  described  In  Section  1  were 
taken  to  be  independent  uniformly  distributed  random  variables  on  the  intervals 
1  0 , s^_  I  and  TO.t^l,  respectively,  where  s^  was  the  100£  th  percentile  of  f  and 
t  was  the  100n  th  percentile  of  f?  (0  <  £,  n  <  1).  For  £  ■  0.9,  for  example, 
slightly  more  than  102  of  the  S^'s  wiiL  be  censored. 

For  each  fixed  set  of  values  tor  m,n,H,£,  and  n»  1000  samples 
(»1 .* , ) ,  ---t  (»mtfin)  and  <tj  ,€  j ) ,  ....  (tn,€n)  were  generated  and  0mn  and  6^ 
were  calculated  for  each.  The  average  and  estimated  mean  squared  errors  (mse) 


were  calculated  for  the  1000  repetitions.  The  resulting  estimated  biases  and 
mean  squared  errors  for  several  cases  are  reported  in  Table  1. 
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TABLK  J.  Small  Sample  Comparison  for  Exponential  Distributions 


n 

9 

i. 

n 

A 

0 

,  ,  mn 

bias 

msc 

*0 

e  „ 

mn 

bias 

mse 

0.90 

0.50 

-0.073 

0.182 

0.313 

0.851 

10 

0.5 

0.75 

0.75 

0.189 

0.345 

0.319 

1.058 

0.50 

0.50 

0.174 

0.226 

0.252 

0.693 

0.90 

0.90 

0.080 

0.133 

0.123 

0.245 

10 

0.5 

0.90 

0.50 

-0.163 

0.069 

0.227 

0.476 

0.75 

0.75 

0.060 

0.111 

0.181 

0.487 

0.50 

0.50 

0.024 

0.039 

0.252 

0.693 

0.90 

0.90 

0.160 

0.531 

0.245 

0.978 

10 

1.0 

0.90 

0.50 

-0.326 

0.276 

0.454 

1.903 

0.75 

0.75 

0.122 

0.445 

0.363 

1.947 

0.50 

0.50 

0.048 

0.157 

0.505 

2.772 

0.90 

0.90 

0.320 

2.123 

0.490 

3.914 

10 

2.0 

0.90 

0.50 

-0.653 

1.104 

0.909 

7.613 

0.75 

0.75 

0.242 

1.780 

0.726 

7.787 

0.50 

0.50 

0.097 

0.628 

1.009 

11.087 

0.90 

0.90 

0.052 

0.067 

0.081 

0.102 

15 

0.5 

0.90 

0.50 

-0.162 

0.051 

0.185 

0.292 

0.75 

0.75 

0.038 

0.047 

0.094 

0.129 

0.50 

0.50 

0.016 

0.017 

0.171 

0.327 

15 

2.0 

0.90 

0.90 

0.209 

1.069 

0.325 

1.635 

0.90 

0.50 

-0.647 

0.810 

0.741 

4.667 

10 

2.0 

0.90 

0.90 

0.119 

1.035 

0.324 

2.223 

0.90 

0.50 

-0.750 

0.872 

0.672 

4.366 

These  simulations  indicate  some  clear  patterns  in  the  behavior  of  8 

nm 

As  m  and  n  increase  for  fixed  0,  £,  and  n,  the  bias  and  mean  squared  error 

A 

tend  to  decrease  for  both  0  and  0  (since  these  estimators  are  consistent). 

on  nin 

As  the  censoring  in  the  two  samples  becomes  more  severe  for  fixed  m,n,  and  8, 

A 

the  bias  and  mse  tend  to  decrease  for  0  and  increase  for  the  mle.  Also. 

-  mn  - 

for  fixed  m,n,C,  and  n,  the  bias  and  mse  of  both  estimators  tend  to  increase 

A 

as  0  increases,  more  so  for  the  mlc  than  for  0  .  Therefore,  even  for  the 

mn 

exponential  distributions  when  both  estimates  are  easily  calculated  from  small 
samples  our  estimator  is  always  superior  to  the  maximum  likelihood  estimator 
in  performance. 


5.  SUMMARY  AND  CONCLUSIONS 

A 

An  estimator  0  of  the  ratio  of  scale  parameters  has  been  developed  for 
the  two  sample  problem  when  the  observations  are  arbitrarily  censored  from  the 
right.  The  estimator  was  obtained  by  logarithmically  transforming  the  problem 
to  one  of  estimating  a  shif t  parameter  by  minimizing  a  Cramer-von  Mises 
measure  of  the  distance  between  two  Kaplan-Meier  estimates  of  the  underlying 
distribution  functions.  This  approach  was  taken  since  the  minimum  distance 
estimation  docs  not  seem  to  yield  a  closed  form  solution  for  the  original  scale 
problem. 

AS 

The  estimator  8  is  shown  to  be  consistent,  and  Monte  Carlo  simulation 
mn 

results  for  two  exponential  distributions  with  different  scale  parameters 
indicate  that  the  estimator  is  superior  in  performance  to  the  maximum  likelihood 


estimator  for  that  case. 
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